About
This report is intended to act as a first-pass exploratory analysis
of the variants in your dataset. Looking for points that are
e.g. extreme outliers in the PCA plot and the depth/missingness plot are
a good way to identify potentially problematic (or interesting!)
samples. Several basic analysis are included, many of which are colored
by the result of a k-means clustering on the first two principal
components. A map is possible to generate if you place a tab-separated
file with sample ID, longitude, and latitude (decimal degrees) with the
extension .coords in the directory the contains all of the
QC output. By default in this pipeline, these are in the subdirectory
06_QC. By default, three populations are assumed for
k-means clustering, which may be a sensible starting point for visual
inspection of your results, but this value can be changed in the config
file.